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A  TRANSFORMATION  YIELDING  AN  ADDITIVE  REPRESENTATION 
OF  DATA  IN  A  TWO-WAY  ARRAY 


James  A.  Paulson 
Psychology  Department 
Portland  State  University 

it  is  common  practice  when  seeking  an  additive  representation  of  data  in 
a  two-way  array  to  try  out  various  transformations  using  Tukey’s  single 
degree  of  freedom  for  nonadditivity  as  an  index  of  the  extent  of  the  deviation 
of  the  reexpressed  data  from  additivity.  It  is  shown  that  when  data  fit 
Tukey's  model  and  an  additive  representation  exists  in  the  sense  defined  by 
Luce  and  Tukey  (1964),  the  transformation  required  to  obtain  the  additive 
representation  is 

fCy)  =  log  (xy  +  1  -  Xy)  , 

where  X  is  the  weight  for  the  degree  of  freedom  for  nonadditivity.  The 
transformation  is  unique  up  to  a  linear  transformation.  This  result  is  in 
apparent  conflict  with  power  transformations  suggested  by  Anscombe  and  Tukey 

(1963).  1  Accer.fiior  For  j 


IntrcxJuction 


Suppose  the  structure  of  espected  values,  (Pij),  of  a  two-way  array 
of  random  variables,  (yj^j)»  is  given  by  Tukey's  (1949)  nodel  with  a  single 
degree  of  freedom  for  nonadditivity. 


Pij  =  u  +  aj^  +  6j  +  . 


The  following  linear  transfer nation  yields  a  multiplicative  representation 
of  the  array. 

Au^j  -  Au  +  1  =  (1  +  Aa^)(l  +  Xgj)  .  (2) 

If  1  +  Auj^  and  1  +  X6j  are  positive  for  all  i,  j,  then  we  nay  take  lo^- 
rithms  on  both  sides  to  obtain  the  additive  representation 

log  (Apj^j  -  Xu  +  1)  =  log  (1  +  \aj)  +  log  (1  +  A3j)  .  (3) 


An  additive  representation  of  an  array  (Pij)  exists  in  the  sense 
defined  by  Luce  and  Tukey  (1964)  if  there  is  a  transformation  of  the  Uj^j's  , 
f(uij)  ,  and  functions  of  the  row  and  colunn  indices,  r(i)  and  c(j)  ,  such 


f(Uij)  =  r(i)  +  c(j) 


for  all  i,  j  and 


f(Uij)  ^  f(Ukl) 


if  and  only  if 


‘'ij  -  ^'kl 


for  all  i,  j,  k,  1. 
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Luce  and  Tukey  show  that  if  such  a  transfornation  exists,  then  it 
is  unique  up  to  a  linear  transfornation,  as  are  the  functions  r( i)  and  c(j). 
Equation  3  shows  that  such  a  transfornation  exists  for  data  described  by 
Tukey 's  rnodel,  provided  1  +  Xa^  and  1  +  X3j  are  positive  for  all  i,  j.  If 
either  ?  +  Xa^^  or  1  +  XBj  are  nor^sitive  for  any  i  or  j,  then  no  additive 
representation  exists.  To  see  this,  note  that  it  follows  irniediately  from 
Luce  and  Tukey' s  definition  of  an  additive  representation  that  the  array 
(Uj^j)  has  such  a  representation  only  if  the  order  of  elenents  in  any  row 
is  determined  by  the  colunn  function,  c(j),  and  the  order  of  elenents  in 
any  colunn  is  determined  by  the  row  function,  r(i).  For  ex'nple,  for 
elements  in  row  i  it  follows  from  Equation  4  that 

Uij  -  Pik  -  0 

if  and  only  if 

f(Uij)  -  f(yik)  =  c(j)  -  c(k)  >  0  . 

Equation  1  can  be  rewritten  to  show  that  the  elenents  in  row  i  are  a  linear 
function  of  6j,  with  slope  1  +  Xaj^  ; 

=  P  +  +  (1  +  Xa^)6j  .  (5) 

If  1  +  Xa^  is  ne^tive,  the  order  of  the  p^j’s  in  row  i  is  the  reverse  of 
the  order  of  the  6j's.  But  in  the  additive  representation  given  in 
Equation  3,  c(j)  =  log  (1  +  XBj)  ,  so  the  order  of  the  c(j)'s  is  the  sane 
as  the  order  of  the  Bj's.  Thus,  the  order  of  the  Pij's  nust  differ  from 
the  order  of  c( j) 's,  contradicting  the  possibility  of  an  additive  represen¬ 
tation. 

The  transfornation  log  ( Xy  -  Xy  +  1) ,  just  shown  to  be  required  to 
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obtain  an  additive  representation  if  Tukey's  nodel  holds,  has  not  been 
sug^sted  before  in  the  statistical  literature,  to  the  author's  knowled^. 
Transf or  nations  of  the  form  log  (y  +  c)  are  mentioned  in  passing  by  Tukey 
(1949) ,  Ansconbe  and  Tukey  (1963) ,  and  Box  and  Cox  (1964) ,  but  the  neces¬ 
sary  connection  between  the  paraneters  of  Tukey's  nodel  and  the  value  of 
c  is  not  discussed.  In  fact,  Ansconbe  and  Tukey  (1963)  propose  a  different 
transformation  based  on  the  parameters  of  Tukey's  nodel — the  power  trans- 
for  nation  .  In  the  case  l-Ap  =  0  ,  they  suggest  the  transfer  nation 

log  y,  which  is  the  only  case  where  their  reconrendation  cc'ncides  with  the 
transformation  inplied  by  the  nodel.  Kruskal  (1965)  and  de  Leeuw  et  al.  (1976) 
have  developed  ^neral  numerical  algarithms  to  find  the  nonotone  transforma¬ 
tion  that  will  render  a  given  two-way  array  nose  additive,  assuming  only 
that  such  a  transfer nation  exists.  It  is  beyond  the  scope  of  this  paper  to 
discuss  the  comparative  virtues  of  ti.a  different  approaches  in  detail,  but 
there  are  a  few  points  that  should  be  mentioned.  This  ni^t  best  be  done  in 
the  context  of  m  ejanple. 


Example 

Figures  la  and  lb  are  two  sli^tly  different  ways  of  plotting  data 
from  an  ejperinent  of  Carterette  and  Anderson  (1979)  on  the  scaling  of 
loudness,  using  the  bisection  method.  This  method  has  the  subject  adjust 
a  sound  to  the  point  where  its  loudness  is  halfway  between  the  loudnesses 
of  two  given  sounds.  Carterette  and  Anderson  propose  a  simple  algebraic 
theory  which  implies  that  the  subjective  loudness  of  the  sounds  which  the 
subjects  give  as  their  besecticai  responses  should  be  an  additive  function  of 
the  loudnesses  of  the  sounds  being  bisected.  A  problem  that  arises  in  eval¬ 
uating  this  theory  empirically  is  that  the  subject's  subjective  sensation 
is  an  unknown  function  of  the  physical  sound  intensity  expressed  in  decibels. 
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Carterette  and  Anderson  circuiwent  this  problem  by  applying  the  nunerical 
algarithm  of  de  Leeuw,  et  al.  (1976)  nentioned  above  to  obtain  a  nonotone 
transfor nation  which  renders  their  data  as  additive  as  possible.  They 
then  evaluate  how  well  the  transforned  bisection  response  data  fit  the 
predictions  of  their  theory.  We  shall  use  the  data  of  the  Carterette  and 
Anderson  to  study  the  trainsfor nation  produced  by  the  de  Leeuw  et  al. 
algDrithnv  the  transfor nation  iiplicit  in  Tukey's  nodel,  and  the  power 
transformation  suggested  by  Ansoonbe  and  Tukey. 

Figure  la  is  a  staridard  plot  relating  the  raw  bisection  responses  to 
the  intensities  of  the  first  and  second  sounds  in  a  pair  to  be  bisected. 

Each  point  represents  the  nean  of  seventy  observations  nade  up  of  ten 
replications  for  each  of  seven  subjects.  The  colurms  are  equally  spaced. 
Figure  lb  presents  the  sane  data,  e«ept  the  colunns  are  spaced  according 
to  their  neans.  Spacing  the  oolunns  according  to  their  neans  provides  a 
graphical  nethod  for  determining  whether  or  not  Tukey's  nodel  fits  the 
data.  Equation  5  inplies  that  the  rows  should  plot  as  strai^t  lines  if 
Tukey's  nodel  fits.  If  the  data  are  additive,  the  strai^t  lines  should 
be  parallel.  If  not,  they  should  neet  at  a  oomnon  focal  point.  The  inter¬ 
section  of  the  lines  obtained  from  Equation  5  for  any  two  rows  is  at 
6j  =  -l/X  .  The  focal  point  of  the  lines  representing  the  rows  is  therefore 

y  =  Vi  +  Oi  +  (1  +  Aa-){-l/A)  (5) 

=  M  -  1/X  . 

Note  that  the  terms  in  the  nultiplicative  representation  given  in  Equation  2 
can  be  interpreted  as  deviations  from  this  focal  value,  eis  follows: 


Second  Intensity  (dB) 


Raw  loudness  bisection  responses,  adapted  from  data  of 
Carterette  and  Anderson  (1979),  plotted  two  different  ways 


t 


Transformed 

Response 

(Arbitrary 

units) 


Figure  2.  Transformations  of  raw  bisection  responses  to  attain  additivity. 
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Xuij  -  Xp  +  1  =  X(yj^j  -  y  +  1/X)  ,  (6) 

1  +  Xa^  =  ^(Pi.  -  P  +  1/^)  » 

1  +  X6j  =  X(y.j  -  y  +  1/X)  . 

Further irore,  the  reciprocal  of  X  is  the  overall  nean  of  the  deviations  from 
the  focal  value.  That  is,  Tukey's  nodel  essentially  says  that  the  ejpected 
deviation  of  an  observation  from  the  focal  value  is  equal  to  the  product  of 
the  mean  deviation  in  that  row,  tines  the  ratio  of  the  mean  colunn  deviation 
to  tiie  overall  nean  deviation.  (The  role  of  rows  and  colunns  can  obviously 
be  switched.) 

The  data  in  Figure  lb  fit  Tukey's  ncx3el  fairly  well.  The  lines  are 
reasonably  strai^t  and  seem  like  they  ni^t  be  converging  to  a  focal  point. 
For  this  data  y  =  72.3  ,  X  =  -.027  ,  and  y  -  1/X  =  109.3  .  The  estinated 
focal  value  has  a  substantive  interpretation.  According  to  Licklider  (1951), 
110-115  (£  is  about  the  point  where  sound  begins  to  be  too  intense  for 
comfort,  for  a  person  who  has  not  adapted  to  listening  to  loud  sounds. 

Sounds  this  loud  give  rise  to  nonauditory  sensations  such  as  tickle  and  pain. 

Figure  2  shows  the  effect  of  three  transformations  on  the  raw  bisection 
response  data;  the  ADDAI5  transformation  of  de  Leeuw,  et  al.  (1976)  employed 
by  Carterette  and  Anderson,  the  power  transformation  suggested  by  Ansoombe 
and  Tukey  (1963),  and  the  transformation  proposed  in  this  paper.  The  latter 
two  transformations  were  further  transformed  by  linear  regression  to  obtain 
comparability  with  the  ADCAI£  transformation  values.  This  is  legitimate 
because  linear  transformation  has  no  effect  on  additivity.  The  zero  and 
unit  of  the  ADDALS  transformation,  being  arbitrary,  were  selected  by 
Carterette  and  Anderson  for  convenience. 
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Carterette  and  Anderson  conclude  that,  on  the  whole,  the  ADDALS  trans- 
fornation  succeeds  in  rendering  the  data  additive,  thou^  they  note  sone  sli^t 
but  statistically  sigiificant  nonadditivity  renaining  in  individual  analyses 
of  sone  subjects.  Figure  2  suggests  that  they  would  have  obtained  siiiilar 
results  with  the  other  two  transfer nations,  which  are  very  close  to  the  ADDALS 
values  ex:ept  for  the  lowest  three  points.  The  correlations  between  ADDALS 
values  and  the  other  two  transfer nations  are  .997  for  the  transfer nation 
inplied  by  Tukey's  nodel  and  .995  for  the  power  transfer nation  sug^sted  by 
Ansconbe  and  Tukey  (1963).  While  these  hi^  correlations  suggest  that  the 
results  would  cone  out  the  sane,  no  natter  which  of  the  three  transfornatioi' 
was  enployed,  it  would  loe  wrong  to  conclude  that  there  are  no  neaningful 
differences  between  the  transfer nations.  For  reasons  to  be  described 
presently,  the  nodified  log  transfer nation  and  the  power  transfer nation 
do  yield  alnost  literally  identical  results  with  data  falling  in  the  rela¬ 
tively  restricted  range  of  the  present  data.  However,  even  though  these 
two  transfer  nations  correlate  very  hi^ly  with  the  ADDALS  transfer  nation, 
they  differ  from  it  systenatically,  tending  to  exceed  the  ADDALS  values  at 
the  extrenes  while  undershooting  them  in  the  middle  range.  Anderson  and 
Shanteau  (1977)  enphasize  the  inadequacy  of  correlation  as  a  test  of  agreement 
between  theory  and  prediction.  These  data  illustrate  their  point  a^in. 

The  general  monotone  transfornation  differs  nost  markedly  from  the  other 
two  with  respect  to  smoothness.  While  the  overall  increasing  trend  is  steady, 
it  is  interspersed  with  minor  fits  and  starts.  For  ejanple,  note  the  little 
steps  in  the  curve  for  values  of  the  transfer ned  response  just  above  2.00, 

4.00,  and  5.10,  and  the  little  junp  that  occurs  at  about  4.80. 

As  noted  above,  vrtien  the  ratio  of  the  largest  data  point  to  the  snallest 
is  not  too  big,  the  transfer nations  and  log  { Xy  -  Xy  +  1)  produce  very 
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similar  results  with  re^rd  to  additivity.  Tukey  (1957)  defines  a  neasure 
of  the  strength  of  a  transformation  which  helps  to  ejplain  why  this  is  so. 
If  z  is  the  transfer nation,  Tukey's  neasure  of  strength,  as  one  gaes  from 
Yl  to  y2  in  the  original  netric,  is 


s{yi,y2) 


log  72  -  log  y^^ 


(7) 


The  rationale  for  this  neasure  is  a  bit  subtle;  see  Tukey  (1957)  for  details. 
For  power  transfer nations  yP  the  strength  is  1-p,  independent  of  the  choice 
of  y^^  and  .  Also,  the  strength  of  the  transfer  nation  log  y  is  1,  inde¬ 
pendent  of  y^  and  ^2  •  Hence,  the  sequence  of  transformations 

•  •  •  /  Y/  Y^»  Yf  Y~^/  Y‘‘^»  •  •  • 


is  equally  spaced  in  terms  of  this  neasure.  The  strength  of  the  nore  general 
transfer nation 

^  _  (y  +  c)P  ,  p  ^  0  , 
log(y  +  c)  ,  p  =  0  , 

depends  on  y^  and  y^  .  However,  power  series  expansions  cf  log applied 
to  Equation  7  show  that  a  gaod  first  appro xi nation  to  the  strength  is  given  by 

S  =  (1-p)  ..Y  '  (8) 

y  +  c 

where  y  is  the  midpoint  of  y^  and  y^  .  inlying  this  approxination  to 

log  (y  -  p  +  1/X)  at  y  =  p  yields 

S  =  (1  -  0)  P 

p  +  (1/A  -  p) 


=  Ap  , 
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which  is  t±ie  sane  as  t±ie  strength  of  the  transfornation  .  Thus,  it 

is  not  surprising  that  log  (y  -  vi  +  1/A)  and  yield  similar  results 

close  to  the  nean  of  the  original  data.  If  the  range  of  the  data  is  not 
great,  one  never  gets  fcir  enou^  from  the  nean  for  the  difference  in  strength 
to  become  noteworthy. 

Discussion 

Researchers  soitEtimss  object  to  the  use  of  transfer nations  because  they 
distort  conparative  differences  between  rows  as  one  goes  from  one  colunn  to 
another.  There  are  two  ways  one  nig^t  respond  to  this  objection.  One  is 
to  note  that  the  original  netric  chosen  to  represent  the  data  nay  be  inappro¬ 
priate,  and  to  insist  on  its  use  nay  cause  the  investi^tor  to  miss  sigiifi- 
cant  regularities  in  his  or  her  data.  In  such  situations,  a  sinple  power 
transfornation  of  the  original  data,  such  as  the  reciprocal  or  square  root, 
nay  be  as  naaningful  as  the  original  netric,  while  leading  to  nore  inter- 
pretable  patterns  of  results. 

A  second  reply  ni^t  be  that  even  if  the  original  netric  is  quite 
appre^riate,  the  usual  statistical  analyses  applied  to  the  data  e^ressed 
in  this  netric  ni^t  lead  to  erroneous  ccaiclusions.  This  is  particularly 
so  if  one  detects  sigiificant  nonadditivity  in  the  original  data  array, 
concludes  that  the  differences  produced  by  one  factor  are  inconsistent  over 
levels  of  the  other  factor,  and  leaves  it  at  that.  Additivity  refers  to 
the  invariance  of  differences  between  row  elements  as  one  gxs  from  colunn 
to  colunn  in  the  array.  Other  invariant  relationships  ndgit  hold,  though 
additivity  does  not,  and  seeking  a  transfornation  to  attain  additivity  night 
lead  one  to  notice  thenu  For  ejanple,  when  Tukey's  nodel  holds,  ratios  of 
intervals  between  rows  are  invariant  from  colunn  to  colunn.  Consider  any 
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four  rows,  and  let  their  row  indices  be  i  =  1,  2,  3,  4.  Then  it  follows 
from  Equation  1  that 


^ij  -  ^2i 

^3j  ~  ^4j  (013  -  04)  XSj 


independent  of  j.  It  was  noted  earlier  that  appropriate  graphical  represen¬ 
tation  of  the  original  data  can  be  useful  in  determining  if  Tukey's  ntxiel 
holds.  If  it  appears  that  it  holds  for  a  given  set  of  data,  it  nakes  sense 
to  apply  the  transfer nation  inplied  by  the  nodel.  Then  a  norc  rigorous  test 
of  the  fit  of  the  nodel  can  be  carried  out  by  testing  the  transforned  data 
for  additivity. 

The  force  of  these  replies  depends  on  the  sinplicity,  or  nonarbitrariness, 
of  the  transformation  used,  v^ich  in  turn  depends  on  the  details  of  the  situ¬ 
ation.  Thus,  when  the  transformation  proposed  in  this  paper  or  a  sinple 
pewer  transformation  will  work,  they  would  seem  to  be  preferable  to  the  general 
nonotone  transf or nations.  It  was  noted  above  that  the  latter  will  usually 
have  a  lack  of  siroothness  which  makes  them  appear  somevhat  arbitrary.  However, 
one  should  not  hesitate  to  enploy  the  nore  cxonplicated  algorithms  in  situations 
where  the  sinple  approaches  fail. 

In  choosing  hDetween  the  power  transformation  and  the  nodified  lo^rithnic 
transformation  proposed  in  this  paper  one  ni^t  be  guided  by  substantive  con¬ 
siderations.  The  ejponent  of  the  inplied  power  transformation,  or  the  focal 
point  of  the  nodified  logarithmic  transformation  nay  nake  theoretical  sense. 

If  not,  the  choice  would  seem  to  be  a  natter  of  taste.  In  that  case,  the 
author  prefers  the  transfer nation  inplicit  in  the  nodel  used  to  assess  the 
nonadditivity. 
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